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IMMUNOGLOBULIN CONTROL REGION 
pleldof the Invention 

This Invention relates to a control region in human immunoglobulins, to 
parts thereof and to their use. 
Background of the Invention 

The human IgH locus contains nine functional C genes and two 
pseudogenes, arranged 5' Cu-C5-Cr3-Cr1-vCe-Ca1-vi/C7-CY2-CY4-Cs-Ca2 3', 
over a 350 kb region of chromosome 14 (1). Overlapping phage and cosmid 
clones established the C gene organisation but attempts to obtain the entire 
region on overlapping clones or a single yeast artificial chromosome (YAC)' have 
been unsuccessful. PCR-based approaches identified highly repetitive regions 
downstream of Ca1 and Ca2 which include virtually identical 3' enhancers made 
up from different numbers of short motifs (3, 4). Similar repetitiveness, leading to 
instability, was also assumed for the estimated 40-70 kb gap between C6 and 
CyS, which could not be doned to establish a C gene contig. Indirect results from 
transgenic mice lacking different regions 3' of Oji and C5 further suggested that 
this particular downstream region might be important for high ejq^ression and 
switching of IgH genes (5, 6). Analysis of recombination in the C5-Cy3 interval 
showed a lack of assodation between these genes, whic^i may indicate a 
potential hot spot for recombination (7, 8). The potential significance of this 
region is further supported by the finding that a large area between C5 and CyS is 
deleted in certain leukemias, which may be linked to a pathogenic mechanism 
active at an early stage of B-cei! development (9). 

In the mouse, the IgH locus has been completely cloned (10) and in DNA- 
binding assays a duster of matrix association regions (MARs) was found in the 
C5-Cy3 intron (11). Although the region was not extensively characterized by 
sequendng, the presence of long interspersed repetitive elements in the vidnity 
of MARs may lead to the high recombination observed in this region of the IgH 
locus. Probes derived from bactenophage dones covering the mouse CS-Cy3 
region failed to identify corresponding sequences in the human locus (9), 



During B lymphocyte development it is generally thought that 
transcriptional activation of frie IgH locus is regulated using two enhancer arrays 
which flank the constant region duster (12-14, reviewed in 15). These arrays, flie 
Em intron enhancer and the 3' enhancer downstream of Co, contain multiple sites 
for the binding of both tlssue-spedfic and ubiquitous trans-acting factors (13, 16). 
Enhancer-mediated ac^vation appears to ba controlled by the interaction of both 
negative and positive regulatory elements (17, 18). The E(j intron enhancer 
provides potential protein binding sites for several regulatory elements which are 
essential for lymphocyte diff^entiation, including E47, PU.1, Ets-1, TFE3, USF 
and Oct (19, 20). The 3'a enhancer shares some DNA sequence elements with 
the B\i enhancer but also has additional motifs for factors involved in 
transcriptional regulation (13, 16). 

Activation and sequential DNA rearrangement of the Ig loci are crudal 
steps in antibody expression and c/s-acting locus elements like enhancers, which 
accommodate various combinations of factor-binding sites, have been implicated 
in IgH locus recombination and transcription (17). Although important infonnation 
about enhancer core functions has been obtained from mutant mouse strains, 
these are pooriy understood processes as there appears to be no activity of 
eifrier enhancer in eariy B-cell development when IgH heavy chain 
rearrangement is initiated. For example, deletion of the heavy chain intron 
enhancer Em showed severe impairment of Vh to DJh rearrangement whilst the 
eariier D to Jh rearangement was much less affected (21). In chimeric mice, in 
which the C<x 3' enhancer was replaced by a mariner gene, isotype deficiency and 
impainnent of heavy chain class-switching were observed (18). 
Summary of the Invention 

The region between C5 and Cy3 of IgH Is unstable and may be a 
recombination hot spot; it has now been shown that 21 kb of the unstable region 
in the human IgH locus between C5 and Cy3 contains a highly clustered array of 
a large number of transcription f^ctor^bincHng motifs interspersed with repeat 
sequences, Transfection assays revealed transcription enhancement and 
silencing acti^4ty at the pre B-cel) stage and in transgenic mice strong enhancer 



function was identified in the bone mamDw, the primary site of B-cell 
difFerentiatlon. Flow cytometry analysis of early B-cell populations showed that 
this enhancer is already active at the pro/pre B-cell stage where DMA 
reanBngement is Initiated, The region accommodating E5-y3 may exert locus 
control function at an early developmental stage, which may be critical in normal 
and aberrant B-cell development 

In particular, a pBAC clone (pHulgH3'S-y3) has been isolated, that 
established a 52 kb distance between 05 and Cy3, Sequence analysis identified 
a high number of repeat elements, e)^laining the instability of ihe region, and ari 
unusually large accumulation of transcription factor-binding motifs, both for 
lymphocyte-specific and ubiquitous transcription activators (IKAROS, E47, Oct-1, 
USF, Myc/Max), and for factors wtiich may repress transcription (DeltaEFI, Gfi-1, 
E4BP4, C/EBPbeta). Functional analysis in reporter gene assays revealed the 
importance of the CS^CyS interval In lymphocyte differentiation, and identified 
independent regions capable of either enhancement or silencing of reporter gene 
expression, and interaction with the IgH intron enhancer E\l In transgenic mice, 
canying a constaict which links the p-giobin reporter to the novel 6--y3 intron 
enhancer {E5-y3), transgene transcription is exclusively found in bone marrow B- 
cells from the eariy stage when IgH rearrangement is initiated up to the 
successful completion of H and L locus recombination resulting in antibody 
expression. These findings suggest that the C8-Cy3 inten/al exerts regulatory 
control on Immunogiobulin gene activation and expression during eariy lymphoid 
development. 

According to a first aspect of the invention, the CS-CyS region is stable in 
an artificial chromosome or non^iuman animal. It can thus be used to produce a 
repertoire of human antibodies or antibody fragments in a non-human animal; the 
functional diaracteristics of the region, including effective enhancer sequences, 
can provide a wider and more usefiji repertoire than has previously been 
possible. 

According to a second aspect of the invention, a functional enhancer 
and/or silencer sequence within the CS-Cy3 region (of the human or non-human 



IgH iocus) can be used as such, e.g. in a heterologous construct, to enhance the 
expression of a heterologous gene. For this purpose, the repeat sequences that 
give the C5-Cy3 region its instability are not needed. 

Products of this Invention may have utility in therapy. The utility may be 
associated with a heterologous gene. 
Brief Description of the Drawings 

FIGURE 1. Location of the pBAC done pHuS'S-yS spanning the gap in the C5- 
Cy3 interval region of the human IgH locus. A, Restriction analysis shows that the 
48 kb Mlul fragment present in pHulgH3'6^3 (bottom) aligns with the 3' end of 
the HulgH YAC (middle, left) and the 5' end of Cosmid [g6 (middle, right), with 
the gap indicating the previously uncloned region of the C5-Cy3 interval (top). 
The hatched region in pHulsH3'5-/3 was sequenced and fragments were 
analysed in reporter gene assays. M, Mlul; X, Xhol; B, BamHI; H, Hfndlll; E, 
EcoRI. (▼) 400 bp EcoRI fragment of plasmid pM5-1-23; (•) y3 gene probe; (♦) 
500 bp Sspl-BamHI fragment from pHulgH3'6-y3. S. Hybridization of BamHI or 
HIndlll digested human sperm DMA and pHulgHS'S-yS DNA with the 500 bp Sspl- 
BamHl fragment derived from pHulgH3'a-y3 confirms that the isolated fragment 
linking the C5-Cy3 gap is in the correct g^mline configuration. 
FIGURE 2. Analysis of nucleotide sequence of pHulgH3'5-y3. A, Dot Plot of 
self-alignment of the 5' 21 kb of the doned region between CS and Cy3 staring at 
the 5' Mlui site (see Rg. 1 for restriction sites) using the MacVedor 6.01 program 
(Oxford Molecular, Oxford, UK), The location of the eleven Alu-repeat-like 
sequences are indicated as boxes below the matrix with arrows indicating the 
orientation of the repeat Three other regions of repetitive sequence are indicated 
by vertical an^ws. Region 1 comprises three consecutive Mstll-IIke elements. 
Region 2 comprises a region of 21 repeats of "CT" followed by 15 repeats of 
"AT\ Region 3 comprises several complete and fragmented AJu motifs, as shown 
in B. S, Detail of the Alu motifs and fragments in region 3. The orientation of the 
Alu motif, indicated by a shaded box followed by an unshaded box, is indicated 
by anrows. The combination of a shaded box followed by an unshaded box forms 



an Aiihrepeat-Hke element (large filled arrows). Separate unshaded boxes 
represent Incomplete Alu motifis in the orientation indicated. 
FIGURE 3, Location of transcription factor recognition motife. The sequence of 
the 5' 20696 bp of the cloned fragment was compared to the TRANSFAC 
database using Matinspector, with a score of 1.0 for core slmilarily and >0.9 for 
matrix similarity (41), Restriction sites are indicated for alignment (see Fig. 1); 
Mlul (IVI), BamHl (B) and HindlH (H). The potential transcription factor-binding 
$jtes are indicated by open boxes. Factors which may enhance transcription in 
lymphocytes include Ikaros (81, 82, 52), E47 (83), Oct-1 (84, 85), USF (45). and 
Myc/Max (51). Factors which may suppres? transcription include DeltaEFI (66), 
Gfl-1 (58), E4BP4 (54, 86. 87), and C/EBPbeta (88). 

FIGURE 4. Transcription enhancer activity of fragments obtained from the 
MluI-BamHi region of pHuIgH3'5-^. A, Location of the subdoned fragments 
(from position 4510, 5555, 5885, 6461 and 6884 down to position 7185) with the 
shaded region maridng the Mlul-BamHI fragment (1-B), position 1 to 7185. The 
subcloned fr^ments were analysed using ludferase reporter gene assays to 
detect enhancer activi^ fn B, murine pre BH:ells (3-1, blacl< bars) and human pre 
B-celis (NALM-6, striped bars) and C, murine plasma cells (IVIPC11, shaded 
bars) and human plasma cells (D6-75, lined bars). Experiments using murine Id- 
Si pre B-cells gave similariy positive results. No enhancer dependent activity was 
found for AH murine plasma cells, the human T lymphoblast cell line Jurkat or 
human KB fibroblasts (data not shown). Bars indicate luciferase activity for the 
different fragments: 451 0-B, 6655-B, 6885-B, 6461-B and 6884-B which are 
different length 3' dones obtained by exonuclease digest of the 1-B fragment 
subdoned in pGL3 (see Fig. 5}, The ludferase activities were nonnalised to the 
p-galactosidase activity of the co-transfected plasmid and expressed as fold- 
increase. Values are averages of four to six independent experiments with at 
least two different DNA preparations. The range of standard deviation is shown in 
Fig. 5. Values were also determined for pGL3 vector alone, for pGL3 containing 
the SV40 promoter, and for pGL3 containing human Ep. 



FIGURES. Transcriptional activity of various pHulgS'S-^S fragments in 
combination with E|j. A. Scliematic diagram of the ludferase reporter gene 
constmct with location of the insert fragments and position of Ep. S, pHulgH3'5- 
y3 map and location of the subcloned fragments (left). Transcriptional activity, 
with standard error indicated, of fragments 1 (Mlul-Hindlll). 2 (Miul-BamHI), 3 
(HIndlll) and 4 (BamHI) with (shaded box) and without Ep (white box) was 
detennlned in pre B-cells aid piasma cells. Assay details are described in Rg. 4 
and below. 

FIGURE 6. Transgene constmct, transgenic mouse Identification and 
transCTiption analysis. A, Stnjclure of the transgene linking E5-y3 (position S88S- 
71 85, see Fig, 4) and a human 3-globin reporter gene (31), S, Southem blot 
hybridization of Ncol digested tail DNA from transgenic founders canying the Ea- 
yS-p-globin transgene. The internal Ncol fragment Is -12 kb and head to tail 
integration produces a -3.2 kb band. The copy number (high copy number 
founder 3823, low copy number founder 3832) is reflected by the signal Intensity. 
C, Northern blot hybridization of different tissue samples as indicated (BM, bone 
marrow; PPs, Peyer's patches) with the p-globin transgene (pG) and an actin 
probe (33) as standard. A transgene specific hybridization signal for p-globin was 
only found for bone marrow RNA. Human blood and nonna! mouse spleen RNA 
served as a control. 

FIGURE?. Moleojlar characterization of B-cell subpopulatlons at various 
stages of development. A, Schematic presentation of B-cell development firom 
stem cell to antibody secreting plasma cell. DNA rearrangement steps for the 
heavy chain (D to Jh and Vh to DJh) and the light (L) chain (Vl to 4) leading to 
surface expression and secretion of Ig are indicated, L chain rearrangement Is 
preceded by expression of surrogate L chain (^LC) and gp13Q/gp50. Expression 
of the differentiation stage-spedfic surface mariners, B220, CD19, CD25, c-kit 
and CD43, is indicated by arrow bars. S, Flow cytometric isolation of crkit 0019" 
[1], okit CD19* [2], CD43* B22Cr [3], CD43* B220* [4], CD2S* CD19* [5] and 
IgM" igD" [6} bone man-ow B-cell populations. C, RT-PCR amplification of 



sorted cell populations 1-6, bone marrow (bm) and human red cefis (HRC) with a 
combination of p-globin (pG) and HPRT primers as indicated. A strong p-globin 
PGR product was fbund in 8220" cells from the pro/pra B-cell type up to the 
Immature B-cell. High copy number mice were used for the analysis shown but 
low copy number mice gave essentially the same result 
Description of Preferred Embodiments 

It will be evident to the skilled person that any suitable artificial 
chromosome can be used in the invention. Sucdi chromosomes include bacterial, 
yeast, eukaryotic and mammalian chromosomes. 

Similariy, it will be evident that any suitable non-human animal can be 
used for the purposes of producing human antibodies or heavy chains. Such 
animals include rodents, sheep, hc»^es, pigs, goats, rabbits, chickens and bovine 
animais. 

A chromosome or isolated polynucleotide according to the Invention may 
comprise all or part of the region tiiat has been identified as having functional 
properities. It will be evident to the skilled person that such functional properties 
may be characteristic of part only cf the whole region. Such parts or fragments 
can be identified by known means. Similariy, transfomiation and other 
techniques that may be required to produce products of the invention are known. 

The following description provides illustrative embodiments of the 
invention. It should not be construed as limiting. 
BAC library construction 

A size-selected library was constructed from human DMA using the 
pBeloBACll vector {22), A Mlul linker was added to the pBeloBACtl vector 
following linearisation with SphI in the multiple doning site. Human genomic DNA 
was prepared from the fibroblast ceil line KB (23) and digested to completion with 
MluL The fragments were size fractionated by Pulsed Field Gel Electrophoresis 
(PFGE) in 1% low melting temperature agarose (SeaPlaque FCM, USA) at 
12V/cm in 0.5 x TBE with a 40 sec pulse for 24 h at 3.5**C, Gel slices containing 
DNA fragments from -30-80 kb were excised, melted at 67X and digested for 1 
h at 40"C with 1 unit Gelase (Epicentre, USA) per 0.1 g of gel. Size-selected 
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DNA (100-200 ng) was iigated with dephosphorylated Miul restricted pBeloBACIi 
vector (25-50 ng) using 4 units T4 iigase (NEB, Canada). Ligation mixtures were 
dialysed against 3 ml TE and 1mM polyamines for 4 in at room temperature using 
IMIilipore filters 30000 NIVIWL (Millipore, USA). Aliquots of the dialysed ligation 
mixture (ipl) were used to traisform £ coli DHIOB by electroporatlon at 120 V, 
25 mF, and 100 Ohms (24). Transfbnned cells were incubated for 90 min at 37*0 
in SOC medium, shaiang at 250rpm, and plated on LB-agar containing 12.5 
]xgfm\ chloramphenicol, 50 pg/ml X-Gai, and 26 \iglml IPTG. Distlnguishabie 
white and blue colonies appeared after 24h, and about 5000 white clones were 
isolated and analysed by colony^lter hybridization with the 0.4 kb EcoRI 
fragment of pM6-1-23 (26) and human CyS (ref. 26 and see below). The insert 
size was determined by MIul digest and PFGE. 
H^ridtzatfon and sequendng 

For hjAjridizalion analysis, genomic DNA (10 |jg) or pBAC DNA (0.5 ng) 
was digested with the desired restilction en^e and separated on 0.8 % 
agarose gels in TAE. DNA was transferred to nylon membranes (Hybond-N, 
Amersham) by alkaline transfer €^d the biote were hybridized with oiigolabelled 
fMTobes at 65'C in Clwch Buffer. Hybridization probes were a 7.4 kb Hindlll 
fragment containing human y3 (26) and a 0.4 kb EcoRI ftagment obtained from a 
3.4 Kb BamHI fragment located -20 Kb *)wnstream of human C8 from the 3' end 
of the HulgH YAC, derived by plasmid rescue after digestion (done pM5-1-23, 
ref. 25). The 5.9 kb BamHI fragment of pHulgH3'8^ (see Fig. 1) was subcloned 
and a 0.5 kb Sspl-BamHl fragment was derived therefrom tor further mapping. 

Sequencing was performed using an ABI Model 373 automated 
sequencer. Sequence Infonmation was obtained uang vector-spedflc primers to 
analyse restriction fragments subcloned in pUC19 or overlapping fragmente 
obtained by exonudease III digestion of firagments (Erase<a-Base System, 
Prome^, UK), and also by custom oligonudeotides based on initial sequence 
infomiation. 



Cell lines and bansfsf^n assay 

The cell lines used for tiie reporter gene analysis were from frie Babraham 
Institute's collection or gift from collaborators and their origin is described in the 
American Tissue Culture Collection catalog. The human cell lines used were 
NALM-6 pre B-cells, DG-75 plasma cells, Jurkat T lymphoblast cells and KB 
fibroblasts (27 and references therein, 28, 29. 23). 

The mouse cell lines were 3-1 and 18-81 pre B-cells and the plasma cell 
lines MPC11 (producing lgG2b) and AH (producing IgM) (30). The cell lines were 
maintained in RPMI (Gibco, UK) supplemented with 10% FCS and 50 \M S- 
mercaptoethanol. 

Transcriptional activity of fragments from the pHulgS'fr^S 5' region was 
analysed with a luciferase reporto* assay system according to the manufecturer's 
protocol using a BioOrbit 1253 Luminometer (Promega, UK). The pGL3 vector 
ensures low background luciferase expression and this allowed the unambiguous 
measurement of enhancer function. The overlg¥)ping fragments firom the 
pHuIg3'5-/3 5' region (see Rgs. 1 and 5) tested were: 1) 10.7 kb Mlul-Hindlll; 2) 
7.2 kb Mlul-BamHI; 3) 8.5 kb Hindlll; and 4) 5.9 kb BamHl. These were Inserted 
5' of the SV40 promoter of the pGL3 reporter gene construct. The size of 
fragment 2 In the reporter gene construct was reduced by creating me necessary 
5' and 3' overhang by Mlul and Kpnl restriction and exonudease III treatment As 
a positive confrol, pGL3 control vector oontelning the SV40 enhancer was used. 

To test the interaction with oth«- IgH enhancers, the human B\i intron 
enhancer (25) was added 5' of an inserted fragment Pre B-cells, pro B-cells, 
Jurkat and KB fibroblasts (2.5x10^) were transfeded by electroporation using 7 
Mg pGL3 containing fragment 1 , 2, 3 or 4 with or without E\i and with 1 |49 pSV-O- 
gaiactosid^e control plasmid as an internal standard. Electroporation conditions 
were 960mF, 200 Ohms and 270-300V. Cells were harvested alter 20-22h 
incubation. Plasma cells (1-3x10^ were transfected with the above constmcts by 
lipofection wifli DOTAP (Boehringer. Mannheim. Germany) according to the 
manufacturer's protocol. The S-galactosidase assay was perfonned in 100 [A 
reaction buffer (0.1 M Na2HP04/NaH2P04 pH7.3; 1mM MgCl; 50 mM 
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H-mercaptoethanol; 1.33 mg/ml o-nitrophenyl-a-0-galactopyraio$tcie} for 30-120 
min. The reaction was stopped with 150 pi 1M NasCOa, and the conversion of 
substrate was measured in a Titertel< l^ultlscan MCC/340 at 410 nm. 
Transgene express/on anaiysis 

The 1.3 i<b E5^3 fragment (position 5885 to 7185 in pGL3) ol>tained by 
PCR was added to a 3 l<b human p-globin gene subioned in pUC12 (31) by biunt 
end ligation Into the Xbal site in the iinicer. A M.3 l<b Saci-Hindili DNA fj-agment 
con^ning ES^3 linked to the p-giobin reporter gene was gel purified and injected 
into the pronuclei of fertilised (C57BU6xCBA)F1 eggs at 1-2 pg/mi (32). 
Transgenic mice were obtained with high and low copy number verified by tail 
blot analysis with a transgene probe, RNA from different tissues was prepered as 
described by tiie manufiacturer using the RNAqueous Kit (Ambion, UK) or the 
FWAzoi B method (AMS Biotechnology, Oxford, UK) for bone mamow. Cell 
preparations were essentially free of ery^rocytes with the exception of liver cells 
where a low percentage remained. Hybridization i^iobes were the p-globin 
transgene and a 540 bp actin gene fragment obtained by PCR (33). For the 
isolation of B-cell subpopuiations by flow cytometry bone marrow cells were 
stained as described (34). Multicolor steiining was carried out with the following 
reagents in combinations ^own in Fig. 7B: PE (phycoerithrin)-conjugated anti- 
mouse CD26 (P3317. Sigma), PE-conjugated anti-mouse CD45R (B220) (P3667, 
Sigma), biotinylated anti-mouse IgM (No. 02082D, PharlVIingen), PE-conjugated 
anti-mouse o-kit (CD117) (No. 09995B, PharMingen), FITC-conjugated anti- 
mouse CD19 (No. 09654D, PharMingen), FITC-conjugated anti-mouse IgD (No. 
022140, PharMingen) and biotinylated ami-mouse CD43 (No, 018020, 
PharMingen). Binding of biotinylated antibody was developed with streptavldin 
quantum red (S2899, Sigma). The oligonudeotides for RT-PCR of p-globin and 
HPRT as a control have been described (31). RT-PCR v«s perfonmed vt^ tiie 
One-Step System (GibcoBRL) under the following conditions: 50"C for 30 mIn 
followed by 94'C for 2 min for cDNA synthesis, followed by 30 PCR cycles (15 
sec 92^, 30 sec 55'C, 30 sec 72*0) and 6 min at 72^ to complete the reaction. 
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7776 dl^ance between CSand C}3is52kb 

Southern blot analysis suggested tiiat the region Isetween C5 and Cr3 is 
40-70 kb in size (1 , 8). We analysed human sperm DI^ and the human fibroblast 
cell line KB (23) by digestion with rare cutting en^mes and hybridization with 
probes from the 3' end of the human IgH YAC and from human Cy3 (see Rg. 1). 
Southern blots of Mlul digests showed a possible region of ~50 kb capable of 
panning the gap between C5-Cr3 (data not shown). A size-selected bacterial 
artificial chromosome (BAG) library was constructed and hybridized with the 0.4 
kb EcoRI fragment of pM5-1-23, from the 3' end of the IgH YAC (25), and human 
Cy3 (26). Of eight dones, pHulgH3'5^, hybridized with both of these probes and 
Mlul rwtrictlon cfigestion followed by Pulsed Field Gel Electrophoresis (PFGE) 
identified a 48 kb insert. Comparison of the restriction patterns of pHulgH3'6-ir3 
with the HuIgH YAC (5) and coslg6 (35) revealed substantial homologies at the S* 
and 3' ends respectively, which suggested that pHulgH^S-^ spanned the gap 
between the C5 and CyS constant region genes (Fig. 1A). The 5' Mlul cloning site 
of pHulgH3'8-Y3 lies -4 kb 3' of the 5 membrane exon 2, a region not yet 
characterized by sequence analysis, whilst the 3' Mlul doning site is located in 
the hinge region of Cy3 (36). Alignment of restriction maps of pHulgH3'S-y3 with 
the HulgH YAC and cosIg6 identified a novel 11 kb region. 

To verify that this novel region is in the correct genomic configuration, 
different digests of human DNA and pHulgH3'5-y3 DMA were analysed in 
Southern blots using different hybridization probes. An example (Fig. 1B) shows 
that hybridization with tiie 0.5 kb Sspl-BamHl fragment from pHulgH3'6-y3 
identified a 5.9 kb BamHI fragment and a 8.5 kb Hindlll fragment both of which 
are present in human genomic DNA and pHulgH3'5-y3. The C5 proximal BamHI 
site and the Cy3 proximal Hindlll site of the hybridization fragments are also 
present on the IgH YAC and cos Ig6, respectively. These results confirm that the 
cloned region on pHulgH3'8-Y3 completes the C6-Cy3 sequence gap and 
establishes the distance between C5m2 and Cy3 as 52 kb. 



12 



The Cd-Cr3 interval region is highly repetitive 

The nucleotide sequence of the C5 proximal 5' region of pHulgH3'5-y3 
(EMBL no. to be added), indicated in Fig. 1 A Isy the hatched line, was determined 
by subdoning of overlapping fragments, exonuclease III digestion and primer 
Nvalking, Self-aiignment of this 21 kb region (Fig. 2A) identified many regions of 
similarity as Alu motrfs (37), with a pronounced cluster at the 3' end (Fig. 2B) that 
incorporated several residual fragments of Alu motifs as well as complete motile, 
in various orientations. In addition, we found transposon-like elements (LINE, 
LTR etc) in abundance and of different complexity (3^40); three 40 bp Mstll-llke 
repeats were arranged in a tandem configuration, which may highlight 
transposon activity, and a 76 bp region comprising 23 (CT) repeats was followed 
immediately by 15 (AT) repeats. The CT and AT repeats encode the amino adds 
pairs Leu/S^ and ile/Tyr independently of the reading frame. The presence of 
this large number of repetitive elements may be the reason for the instability of 
the locus observed during doning effbrts. 
A cluster of transcription factor recognition mo^ 

Potential binding factor recognition sites of regulatory proteins were 
identified by comparison of the pHuIgH3'5^3 sequence with the TRANSFAC 
transcriptlon^clor database using Matinspector (41, 42). As shown in Fig. 3, 
many binding motifs were represented very frequently despite the high score 
used for the database search, which was 1.0 for core similarity and 0.90 or 
greater for matrix similarity. The transcription fedors identified can operate either 
to enhance or suppress immunoglobulin transcription, and many are also present 
in the and 3'a enhancer regions (reviewed in ref. 43, 44). Binding sites were 
found for the activating proteins Ikaros, E47, Oct1, USF and Myc/Max (45-51), 
The Ikaros gene products belong to the group of zinc finger DN A binding proteins 
(52) and have a complex role in the earty stages of lymphocyte development with 
establishing the maintenance and differentiation of multipotent progenitors (53). 
in the group of repressor proteins, binding sites for DeltaEFI, Gfl-1, E4BP4 and 
C/EBPbeta were identified (54-57). The potential binding-site of the nuclear zinc 
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finger protein Gfi-1 (growth factor independence 1) has been identified In a large 
number of eukaryotic promoter-enhancers (58, 59), 

No similar large accumulation of different binding protein recognition sites 
vvas evident vi^en other regions of the immunoglobulin H and L chain introis, 
including loiown enhancer regions, were examined. The presence of such an 
extensive and varied cluster of transcription factor-binding motifs may suggest 
that the region between C5 and Cy3 exerts previously unrecognised ig locus 
control function. 

Pre S-ceff spe&fic enhancer activity ctomstream of CS 

To determine any funcfional significance of the C6-Cy3 interval in 
development and whether the region contained new ds-acting regulatory 
sequences we analysed transcriptional promoter activify of subdoned fragments 
from pHulgHS'S-yS using a luciferase reporter gene assay. The analysis identified 
a 7.2 kb Mlul-BamHI fragment from the 5' C8 proximal region which showed good 
enhancer activity in human and mouse pre B-cells (NALM-6 and 3-1) when 
placed 5' of the luciferase reporter, but did not promote ludferase acHvi^ in 
human or mouse plasma cells (Figs. 4 and 5). As other fragments did not exhibit 
B-cell specific enhancer activity we wondered if the failure to demonstrate 
enhancer activity in Ig2b produdng MPC11. in which the CS-CyS Intron regions 
are deleted, was a consequence of the IgH locus having undergone isotype 
switching In this cell line. This was not the case as IglVf produdng AH myeloma 
cells produced very similar results for all fragments (data not shown). In addition, 
the orientation of the subdoned fragments did not alter their functional activity 
and enhancer function of the Mlul-BamHI fragment in pre B-cells was maintained 
in both transcriptional orientations. 

To detennine the predse location of this novel igH 5-y3 intron enhancer 
we dissected the Mlul-BamHI fragment by Mlul and exonudease digestion whidi 
removed the 5' region and resulted in various smaller size fiTagments (Fig. 4A). In 
murine pre B-cells, enhancer activity of the whole region (position 1-7185) 
exceeds that of the Ep enhancer but the most dramatic increase, twice the 
activity of Ep, was obtained with a 1.3 kb fragment from position 5885 to 7185 
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(Rg. 4S). In comparison to mouse cells human pre B-cells showed 

marked reductions in transcriptional activation. This may simply be a reflection of 
reduced transfection effidenctes of the human cells but could also indicate 
conc^ed interaction of several functional motifs which are recognized differently 
in mouse and human pre B-cells. However, the strongest transcriptional 
activation in both human and mouse pre &<alls was obtained by the fi^gment 
from position 5885-7185. Comparable enhancer activities, dependent on the 
developmental stage of tfie cell rather than its species of origin, agrees with what 
has been found for other human Ig enhancers analysed in vitfo and in transgenic 
mice (31, 60, 61). No enhancer activity was obtained at the later developmental 
stage of tfje plasma cell (Fig. 4C). In addition, neither fibroblasts (as control cells) 
nor T lymphocytes showed any enhancer activity (data not shown). These results 
from reporter gene transffecHon assays define a novel region which we named 
E5-y3 with Strong ds-acSng enhancer ftjnction operative at the pre B-cell stage. 
A transaipSon silencer is tocatsd adjacent to ff^ enhancer 

The proximity of Em and E5^ led us to speculate about possible 
enhancw co-operation during B-cell development. For this we added tiw human 
Em intron enhancer to 4 separate but overlapping fragments of the 21 kb 
sequenced 5' region of pHulgH3'8-Y3 and measured transcriptional activity in 
iudferase reporter asssys (Fig. 5). It is Interesting to note that except in one 
combination (Em + fragment 1) enhancer activity remained by and larae as 
identified in single enhancer/fragment constmcts analysed in B-cell subsets. This 
suggests separate functions of Em and ES-r3 and tinat tiielr enhancer activities 
are not simply additive. Enhancer activity was maintained when Em was 
combined with fragments 2, 3 and 4, whilst the combination of Em with fragment 1 
completely abolished enhancer activity in pre B-cdls and significantly reduced 
enhancer activity in plasma cells (Fig. 5S). This suggests that fragment 1 
contains a B-cell specific transcription silencer which is likely to be located at the 
3' end of fragment 1. Fragment 4 shares a con^sponding region and this may 
explain why transcription levels are somewhat reduced in pre B-cells when 
coupled to Ep. Furthemiore, fragment 1 contains the silencer in dose proximity to 
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the E5-y3 enhancer identified on fragment 2. Thus, ttie laci< of enhancer adivily 
of fragment 1 can be explained by the presence of a strong B-cell specific 
silencer in this region. It emerged that furt^wr dissection of the fragments was 
ineffiBCtive, as ES^3 enhancer and silencer activity was reduced or abolished, 
and suggests that recognition motif combinations are essential. The results show 
that the 5'C5-Cr3 region accommodates a previously unidentified B-cell specific 
enhancer-silencer array which may Interact with Ejj to control Ig expression 
during developmental processes. However, the many varied sequence 
recognition motifs in the C5-Cy3 Inten/al, together with the coordinated enhancer 
Interaction identified, indicate a more complex artivity with perhaps other 
transcription modifiers. 

In transonic mice ES-y3 i$ active in the develof^ng B^l 

To determine in vNo spedfidfy of the novel enhancer we constructed a 
transgene composed of E5-r3 linked to human p-globin as a reporter gene (Rg. 
6/\). A hi^ and a low copy number transgenic mouse line with head to tail 
integration of the transgene was idwitified by Soumem Wot (Fig. 6S) and RNA 
was prepared from diffarent tissues for Northern analysis and RT-PCR. For tfie 
essays actin or HPRT transcripte were used as internal controls. In Northern 
hybridization we were surprised to find p-globin expression in the bone marrow 
and not in other B-llneage fissue (Rg. 6C) which suggests an essential role of 
E5-y3 in B-cell development We then used RT-PCR as a more sensitive method 
to analyse various B-lineage, T-lineage and non-lymphoid tissues. The result was 
the same, the driven transgene is silent in all non-lymphoid tissue as well 
as in the T-cell compartment, and extensive p-gl<*in specific transcription Is only 
found in bone marrow RNA. These results and the in vitro reporter gene assays 
suggest differentiation stage-spedfic activity of ES-yS from pro B-cell 
development (initiaton of DNA rearrangement) to mature B-cell (surface IgM) 
prior to migration from the bone marrow. 

To identify differentiation stage-specific cell populations in which ES-yS is 
active we used flow cytometry and RT-PCR (Fig. 7). The progressive stages in 
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B-cell development are well characterized by cell surface markers and DMA 
configuration (62). We isolated several distinct cell populations from the bone 
manw identified by staining with labelled antibodies specific for the B-cell 
markers B220 or CD19 in combination with specific antibodies for okit and CD43 
(pro B and pre B-l stage), CD25 (pre B-II stage) and IgM/lgD (immature to 
mature B-cell) (Fig. 7S). RT-PCR identified p-globin activity in B220* or CD19* B- 
cells frwn bone marrow up to the immature IgM^igD' population which has 
completed the DMA rearrangement process. Transgene activity may be irfiliated 
at the eaiiy pro B-ceil stage in o4dt 6220" cells where a faint p-globin specific 
PCR band (relative to the HPRT signal obtained in the same reaction) was found. 
As the number of o-/icfif*CD19' cells was very low this result may have been 
obtained ftom a smalt cross-contamination of the more abundant C019^ cell 
population. However, at the stage when DNA rearrangement, D to Jh joining, Is 
initiated in o-Aft*CD19* and CD43*B220* B-ceWs (63) the E3-r3 enhancer is 
active. H ciialn and largely L chain rearrangement is finalised at the immature B- 
cell stage and cells expressing surface IgM show that ES^S driven ^-gldbin 
expression is extinguished at this maturation stage. This indicates that the ES-yS 
enhancer is acfive at tlie eaiiesl DNA rearrangement event and silent when the 
rearrangement process (V-D-J for the H chain and V^J for a L chain) has been 
successfully completed. 

The preceding illustrative description leads to various condusions, of 
certain will now be discussed. 
Instability of tlw C&CyQ interval 

Use of the BAC cloning ^em, capable of maintaining highly repetitive 
l^e insert DNA (22. 64). enabled us to establish the distance between C5m2 
and Cy3 as 52 l<b. Southern analysis and alignment of pHulgHS'S-yS with the 3' 
end of the HulgH YAC (5) and the 5' end of coslge (35) confirmed the overlap 
and identified a novel region of 11 kb. The falure of previous attempts to done 
the CS-Ct3 contig in cosmids or YACs can now be fully explained by the 
®ctensive repetltiveness of the Intronic sequence. Such repetitive regions are 
prone to deletion through homologous recombination when doned in yeast (65). 
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It is esdmated that /Uu-)ike repeat sequences are represented on average every 
4 kb throughout tiie genome (66, 67). In the interval from C5m2 to Cy3, 11 Alu- 
like repeats were identified in a 15 kb region. In contrast, the number of Alu 
motifs identified in the 950 kb region con^ning the variable genes of the human 
IgH locus was mudn less than expected by random distribution (68). The 
presence of Alu repeat elements Immediately adjacent to proto-oncogene 
translocations has led to the suggestion that frequo^t Alu motifs may predi^se 
a region as a b<A spot for recombination (69, 70). Indeed, the MstlN9(e 
sequences, identified in tandem in the C5-C^ Interval, align with the consensus 
sequence of transposon-IIke elements in the human genome (39, 40). The 
instability of the region is flirttier supported by linkage data, which irKlicate a lack 
of assodation between CS and Cy3 (7) and that certain ieukemias have deletion 
boundaries in the C5-Cy3 r^ion (9). Thus, translocations near the enhancer- 
silencer array in 0ie interval may result In transcriptional alterations the 
rearranged genes or loci leading to a malignant phenofype of the cell subset 
where enhancer activity can be identified. Interestingly, chromosonal 
translocations in BurWtfs lymphoma vrfiwe omyo «qpresslon is deregulated by 
linkage to a known enhancer may represent tumors of a developmental st^ in 
which the enhancer is active. 
A Factor-Ending site cluster 

The firequency of potential transcription factor-binding sites in the 21 kb of 
the CS-Cy3 interval region sequenced was unexpected. Using the default scores 
for tile analysis, 0.75 fw core similarity and 0.85 for matrix similarity (41) 
identified too many motife to be useful. Our analysis used scores for the core 
similarity of 1 and matrix similarity of 0,90. This higher score allowed the 
unambiguous identification of multiple motifs for 9 transcription factor-binding 
sites, five of which are recognised by proteins ttiat have shown to irrerease 
transcription in lymphocytes (Ikaros. E47, Oct-1, USF and Myc/Max) and four 
motifs shown to be recognised by transcription silencer or repressor proteins 
(DeHaEFI. GfH, 64BP4 and C/EBPbeta). It has been shown tiiat the repressor 
proteins Delta-EFI and Gfi-1 Interact with elements of the Eu enhancer (59, 58). 
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However, it is not obvious how the lymphocyte-spedfic enhancer and repressor 
activity identffied by functiOTal analysis of C5-y3 fnt^al fragments relates to this 
accumulation of potential transcription factor-binding motifs. Core sequence 
motifs for factor-binding sites found in also occur frequentiy in this novel 
enhancer region and, Indeed, In the whole 21 Id) region analysed (see Fig. 3). 
That all 9 binding sites appear flpequentiy tiiroughout the E5-y3 interval makes It 
impossible to predict tine activity of a particular region solely based on tine 
nucleotide sequence. Furthemiore, besides short sequence motife no homology 
to draw conclusions alDOut functional similarity was found between the Mlul- 
Hindlll fragm^ accommodating E5-y3 and tine or 3'a enhancers (71-74). 
Similarly, tiie region responsible for flie transcription silencer activity could not be 
deduced firom sequence comparison. However because of the significance of 
the functional activity identified in the C5^3 interval it is unlikely that the motifs In 
tills cluster are randomly distributed. In addition, recognition sequences essential 
for suppressing the function of flie mouse 3' enhancer have been reported (30), 
but were not ewdent in tiie sequenced region. Sequence comparison of human 
E5-y3 witii tine available mouse 6^3 interval sequence did not allow Identification 
of an equivalent mouse enhancer, however, a reason for this could be the 
apparent sequence gaps. 

An essential role of the C&CyS interval in early lymphocyte development 

The location of the C5-Cy3 interval means tiiat it will be deleted after 
switching from Cm to other isotypes. This Implies tiiat the regulatory activity of 
this region must be important during eariy developmental steps, which is 
supported by our finding tiiat E8-y3 enhancer control Is operative at the pro/pre 
B-cell stage (see Fig. 7). Reporter gene assays suggest tiiat the CS-CyS interval 
region provides strong B-cell specific enhancer and repressor function. The 
strengtii of ES-yS, position 5885-7186, can be 2-fdd greater than Ep enhancer 
activity in pre B-cells, but unlike Ep and Ea3' the 8-y3 interval enhancer does not 
exhibit any activity in mature B-cells. Interestingly, ES-y3 activity is solely B-oell 
spedfic unlike tiiat of Ep which shows some activity in T-cells at the 
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developmental timing of rearrangement (75). At developmental stages where 
neither the nor the E3'a enhancer was particularly acSve by Itself, enhancer 
combinations identified synergistic transcriptional activity (76). A different picture 
emerged when C5*73 inten^af fragments in combination with were transfeded 
(Fig, 5). Here individual enhancer function remained cell type-specific rather than 
synergistically increased; In ES^S^-Ep constmcts the activity equalled that of E5- 
y3 in pre B-cells whilst Em levels were obtained in plasma cells. The identified 
silencer extinguished both enhancers at the pre B-celf stage v\^Ich suggests a 
complex regulatory funcA'on of elements in the E5-y3 Interval which appear to 
coordinate stage-epedfic H chain activation. When linked to a p-globrn gene (that 
does not contain any cell-type^edfrc intragenic regulatory elements active in 
hematopoietic cells) and ass^ed in transgenic mice, E5^3 is a B-celi specffic 
transcription enhancer element active during B-cell development in the bone 
marrow. This pattem of activity complements the transcriptional activities of Ep 
and E3'a, contributory to V to DJ joining, switching and H chain expression (18, 
21, 77, 78), and may suggest a rote of ES-y3 in heavy chain activation and/or 
initiation of DNA rearrangement A view that the presence of several enhancers 
in the Ig loti simply reflects redundancy cannot be supported with these results 
which put the functional activity of the IgH enhancers in a possible order. Starting 
with the earilest; E5-r3 may be important for H chain activation vwth initiation of 
DNA recombination; Ep may then complete the joining process to allow H chain 
expression; E|j may also be involved in facilitating isotype swiching; Ea3' may 
complete the switching process, which also deletes Ep and E5-y3, and may have 
a role in influencing expression. 

The high frequent of transcription factor-binding sites and the funcfional 
activity of the CS-CyS intron, which includes an enhancer-silencer array, is 
reminiscent of the locus control regions (LCRs) described for the globin locus 
(79, 80). It is characteristic of LCRs that they are essential for the correct 
activation of a locus to ensure physiological expression levels are obtained. In 
transgenic mice carrying part of the human IgH locus, high level expression of 
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the transganes independent of copy number and integration site was not 
achieved (5), This suggests that neither the Em nor the 3' enhancer region is 
sufficient in itself to ensure the full ac^'vity of a single copy translocus. In 
transgenic mice, a translocus that includes a complete CS-CrS interval may allow 
correct chromatin opening and gene activatloa Identification of an equivalent 
region in the mouse and its removal by a knock-out approadi may shed fijrther 
light on IgH locus activation and the DNA recombination processes which are sfill 
poorly understood. 

Construction and expression analysis of the human IgH YAC have been 
described in Nicholson etal, J. Immunol., 163, 6898-6906, 1999. Techniques to 
manipulate YACs by extension have been described by Popov etal. Gene, 177. 
195-201, 1996. 

The extension of the hulgH YAC by adding a (y) 3 constant (C) region 
allows the study of isotype switching. We used two strategies to obtain a human 
IgHGS YAC: 1} addition to the y3 region on a cosmid to the hulgH YAC and 2) 
cloning of the authentic ^50 region between C5 and Cy3 in pBAC and 
elongating the hulgHG3 cosmid YAC by addition of this region. This produced an 
-300 kb human IgH nagion containing the authentic germ line region from VH2-5 
to downstream of CyS. 

in detail, the experiments were done in the following way: 
1) hulgH YAC-contalning yeast ceils were co^ransfected with BssHli 
linearised coslg6 (the Cy3 containing cosmid produced by Flanagan and 
Rabbltts, Nature, 2QQ, 709-713, 1983) and a non-centromeric YAC arm 
replaceent containing ADE2 as selectable marker (Markie e^ a/, Somat Cell. Mol. 
Genet, 12,161*169, 199). This allowed tandem integration and selection of 
modified YACs. lgG3 YACs wem initially kientified by PCR and further analysed 
by normal gel electrophoresis and pulsed field gel electrophoresis on Southern 
blots. IgH yeast clones containing the complete y3 cosmid region in correct 
transcriptional orientation, integrated in the predicted way, were used in 
protoplast fusion of yeast and ES cells* The resulting huigHGS E8 cell clones 
were further analysed for gene content Four different hulgHGS clones modified 
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as shovwi in Fig. 8 were used to produce chimeric mice and also resulted in gem 
line transmission mice. Expression of human lgG3 was found in ail four founders 
but the levels identified in ELISA assays were relatively low (ng/ml). Breeding 
into the ^MT background (a mouse strain with silenced mouse heavy chain 
locus, Kitamura et a/, Nature, 350. 423^26, 1991) may give more efficient 
human IgG3 expression. 

2) The human lgHG3 YAC with the integrated cosmid DNA contains a 
deletion of vital regulatory sequences Identified on a pBAC clone which covers 
the rather unstable region between C5 and Cy3 (see above). In order to produce 
a YAC with the authentic region from the VH cluster downstream of the y 
constant region genes we co-transfbrmed the cosmid containing r3 YAC with the 
Mlul linearised pBAC and the HISS gene as a selectable marker (l^ishiwaid et al, 
Uol Gen. Genet.. 208, 159-167, 1987). The HISS gene was subdoned into 
URA3, a gene present on the non-centromeric YAC ann. Both genes/regions, 
pBAC and HISS In URA3, allow homologous Integration into the YAC and oon^ct 
integration was Identified by PCR and Southern blot (Fig, 9). YAC-ES cell 
protoplast fusion was canled out and the resulting clones were analysed by 
Southern blot and PCR- Two clones were used to obtain chimeric mice which 
may be bred to obtain genm line transmission. 
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